A Central Limit Theorem for the Zeroes of the Zeta Function 

Brad Rodgers 



Abstract. On the assumption of the Riemann hypothesis, we generalize a central limit theorem of 
Fujii regarding the number of zeroes of Riemann's Zeta function that lie in a mesoscopic interval. 
The result mirrors results of Soshnikov and others in random matrix theory. In an appendix we 
put forward some general theorems regarding our knowledge of the Zeta zeroes in the mesoscopic 
regime. 



1. Introduction 

This is an account of a mesoscopic central limit theorem for the number of zeroes of the Riemann 
Zeta function as counted by a (possibly) smoothed counting function. We assume the Riemann 
hypothesis (RH) throughout the note, although we discuss meaningful ways around this assumption 
in the conclusion. On RH, the zeroes of the Riemann Zeta function may be labeled | + 17, where 7 
is real. As is customary, we sometimes refer the 7's themselves as zeroes, at least where there is no 
confusion caused. Our concern is the statistical distribution of 7 near some large (random) height T. 

Notation: We will follow the conventions that e(x) = e l2 ' KX , /(£) = J e(— x ■ £)f(x)dx, g(x) — 
J e(x ■ £)<?(£) d£- I n addition we use the notations < g(x) and f(x) = 0(g(x)) interchangeably. 

If N(T) is the number of nontrivial zeroes in the upper half plane with height no more than T, then 
the number of zeroes N(t + h) — N(t) to occur in an interval [t, t + h] is expected to be roughly h-^- 
[30] . It was first shown by Fujii [9] that the ocsillation of this quantity is Gaussian, with a variance 
depending upon the number of zeroes expected to lie in the interval. 

Theorem 1 (Fujii's mesoscopic central limit theorem). Let n(T) be a fixed function tending to 
infinity as T — > 00 in such a way that n(T) — o(logT), and let Xt be a probability space with random 
variable t uniformly distributed on the interval [T, 2T] . Then, letting A — A(i, T) := N(t + ) — 

N(t), 

E Xr A = n(T) + o(l), 



Var Xr (A) - — log n(T), 



and in distribution 

A — EA 
VVarA 

as T — > 00 . 



TV (0,1) 



The main purpose of this note is to generalize Fujii's theorem in the following way: 
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Theorem 2 (A general mesoscopic central limit theorem). Let n(T) and Xt be as in Theorem\]\ 
For a fixed real valued function rj with compact support and bounded variation, define 



A, = A,(t,T)=]>>(^( 7 -i)), 



where the sum is over all zeros 7, counted with multiplicity. In the case that J \x\\r)(x)\ 2 dx diverges, 
we have 

E XT A n =n(T) [ n{t)dt + o(l), 



and in distribution 



■^/VarAjj 
as T — > 00 . 



Var Xr (A„)~ / \x\\f,(x)\ 2 dx 

^(T) 



^"^-^(0,1) 



It is a straightforward computation to see that Theorem [T] follows from Theorem [5] by letting 

V = l[-l/2,l/2]- 

We focus mainly on the proof of Theorem [2] in this note, but one can also prove, in essentially the 
same way, a mesoscopic central limit theorem for variances which converge: 

Theorem 3. For n(T) and Xt as in Theorem^ but r\ with compact support and bounded second 
derivative, the integral J \x\\f/(x)\ 2 dx is necessarily finite, but the conclusion of Theorem^ remains 
true even still. 



We call Theorems [TJ [2j and [3] 'mesoscopic' central limit theorems as they concern collections of 
n(T) zeroes which grow to infinity, but intervals whose length tends to all the same. 

On such mesoscopic intervals (averaged as in Theorems Q] and [2j) , all evidence points to the zeroes 
resembling points in a determinantal point process with sine kernel. In the microscopic regime (when 
n(T) = O(l)) this is known to be the case, provided we restrict our attention to sufficiently smooth 
test functions. (See Rudnick and Sarnak |23j or Hughes and Rudnick [14].) The techniques that follow 
allow us to recover these results for smooth test functions, as well as extend them to a mesoscopic 
regime, in a sense to be specified. We discuss these matters further in the appendix. 

For the moment, we may simply note the similarity of Theorems [T] and [2] to certain results in the 
theories of random matrices and determinantal point processes (for an introduction to the latter, see 
or the introduction of |27l): 



Theorem 4 (Costin and Lebowitz). Let X be a determinantal point process on K with sine kernel 
K(x,y) = s '"^^~^ , and A a count of the number of points lying in the interval [0,L]. Then 

E x A = L, 
Var x (A) ^ i-fogL 

and in distribution 

^£^(0,1) 
VVarA 

as L — > 00. 



In fact much more generally 27 , 
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Theorem 5 (Soshnikov) . For a family of determinantal point processes parameterized by a variable 
L, with Hermitian correlation kernels, if fh are bounded measurable functions with precompact support, 
define 

A/=]T/(^) 

where ((&j)) are the points of the point process. As long as Var^A^ — » oo, and 

sup\f L (x)\=0(V a vA fL Y, E L A| /U = 0((VarA /t ) 5 ) 
for all e > 0, and some 6 > 0, then in distribution, 

A/ t - EA /L 



v N(0,1), 

V Var s fL 



as L — > oo. 



A computation reveals that Soshnikov's theorem agrees with Theorem [5] for /t(x) = fix/L) for a 
sine kernel determinantal point process. In the case that the variance converges (or in the language 
of Theorem [2j |a;||?7(a;)| 2 is integrable), the analogous result was heuristically derived by Spohn [29j . 
and proved rigorously by Soshnikov in |28) . 

Although we obtain a partial analogue of this result in Theorem [3J we do not recover the full 
analogue. We should expect that Theorem [2] is true even in the case that J \x\ \r)(x)\ 2 dx converges 
with no more restrictions on rj than a bound on variation - this would encompass Theorem [3J - but 
in the latter theorem we require not only that this integral converge, but that it converge somewhat 
rapidly. Bounding an error term prevents us from accessing the results in between the two theorems, 
even though by analogy we should fully expect them to be true. 

Other similar results for the eigenvalues of unitary matrices were proved by Diaconis and Evans in 
[6]. using a perspective perhaps most akin to ours here. 

In fact, Fujii proved a more general result than Theorem [TJ encompassing macroscopic intervals as 
well. In order to state Fujii's result succinctly, we recall the definition 

S(t) :=argC(|+it), 

where argument is defined by a continuous rectangular path from 2 to 2 + it to i + it, beginning with 
arg2 = 0, and by upper semicontinuity in case this path passes through a zero. S(t), as it ends up, 
is small and oscillatory, and our interest in it derives from the fact that it appears as an error term 
in the zero counting function: 

(1) N(T) = i argr(i + if) - £ log^ + 1 + S(T). 

Theorem 6 (Fujii's macroscopic central limit theorem). Let Xt be as in Theorem^ and n(T) 
with logT < n(T) < T. Define A = S{t+ ^p-) - S(t). Then 

E Xt A = o(1), 
Varx T (A)~^loglogT, 

7T 

and in distribution 

—L=^N(0,1) 
VVarA 

as T — > oo . 



Note that in this case, if A is defined as before with respect to the function N(t), Ex T A does not 
have quite as nice an expression owing to the growth of the logarithm function. 
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In fact, it will in general prove preferable to work with S(t) in place of N(t) in the computations 
that follow. Differentiating ([T]), we have 

[d(d)-^]dt = dS(0, 

where 

d(0 -7), 

7 

with the sum over zeroes counted with multiplicity, and 

Using the moment method and Stirling's formula 0, Theorem [5] will be implied by 

Theorem 7. For rj a real-valued function with compact support and bounded variation, for n(T) — > 
oo as T — > oo in such a way that n(T) = o(logT), 



T 



it r 



T 



k 



dt = (c k + o(l)) 



n{T) -| fc/2 

|a;||r}(x)| 2 dx 

-n(T) 



provided the integral on the right diverges. Here ci :— (£ — 1)!! for even I, and eg := for odd I, are 
the moments of a standard normal variable. 

Theorem [3] will follow from the above statement where rj is instead restricted as in Theorem [3l 

In order to prove his results, Fujii made use of the moment method, and the following approximation 
due to Selberg [24].[25]. 

r2T 



1 r [s(t) E fiMW,,^ 

1 Jt l p < Tl/k VP J 



T , 

which Selberg had used earlier to derive a more global central limit theorem for S(t), 

i I |5(t)| 2 ^^^(loglogr) fc . 

These formulas are sufficient to prove Theorem [5] for test functions 77 which are sums of a finite 
number of indicator functions. They break down, however, in an attempt to prove the theorem for 
general ?y, since, although one can approximate rj by simple functions, the error terms thus generated 
rapidly overwhelm the main terms of the moments. 

Our approach, roughly stated, is a sort of weak analogue of Selberg's and Fujii's. In this, we follow 
the derivation [14] of Hughes and Rudnick of mock-gaussian behavior in the microscopic regime with 
respect to sufficiently smooth test functions. We extend these computations to the mesoscopic regime, 
still requiring smoothness, but the key point which allows us to obtain our central limit theorem is 
that any test function will become sufficiently smooth when dilated as they are in the central limit 
theorems [2] and El This is one clarifying feature of our proof. The proof of Fujii's theorem making 
use of Selberg's approximation for S(t) leaves the link between this central limit theorem and the 
microscopic determinantal structure of the zeroes somewhat mysterious. 

This approach, with slightly more work, can be used to produce Fujii's Theorem|6]as well, although 
in this case an analogue of Theorem [2] would appear to be less satisfying. We shall not prove so in this 
note, but in the macroscopic case already if 77 is so much as absolutely continuous, the variance and 
higher moments of (defined in the obvious way) tend to 0. This is a feature of the rigidity of the 
distribution of zeroes at this regime, which while not quite as rigid as a clock distribution (see 



^Stirling's formula implies that ^ = IgiM^) + 
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for a definition), resemble at this level this distribution perhaps somewhat more than they do a sine 
kernel determinantal point process. One should compare this analogy with the classical theorems that 
for a fixed h, N(t + h) —N(t) x logi for all sufficiently large t, with constants depending upon h. (Sec 
[30j . Theorems 9.2 and 9.f4.) In this regime, arithmetic factors play a heavy explicit role; this will be 
implicitly evident in the proof that follows. In this, we can recover the heuristic observations of Berry 
PQ regarding the origin for the variance terms in Fujii's theorems. Indeed, one can already discern, 
by comparing Fujii's central limit theorem to the central limit theorem of Costin and Lebowitz, that 
the statistics of the zeros in this regime cannot be modeled too closely by a sine-kernel determinantal 
process. Outside of the mesoscopic regime, these statistics demonstrate an important 'resurgence 
phenomenon' discovered heuristically by Bogolmony and Keating, and explored in [3], [19] , |21j and 



Zeev Rudnick pointed out to the author that he had used similar ideas with Faifman in [5] to prove 
a Fujii-type central limit theorem, for counting functions with a strict cutoff, in the finite field setting. 

One can apply these ideas to get central limit theorems as well for the number of low-lying zeroes 
of L(s,Xd)i where \d ranges over the family of primitive quadratic characters, by extending the 
microscopic statistics of Rubenstein [22 . Details will appear elsewhere. 

While a version of this note was in preparation, a preprint [4] of Bourgade and Kuan was uploaded 
which independently proves Theorem [3J along with its aforementioned analogue for converging vari- 
ances. They proceed by a slightly different method which makes use of the Hclffer-Sjostrand functional 
calculus and a formula of Selberg. Their conditions on admissible test functions differ slightly as well; 
in particular they state a result which in its technical range of test functions is more general than 
Theorem [3j The method of either paper could be extended to yield a more general result, somewhat 
messier in its formulation. This matter is discussed in somewhat greater detail in the proof of Theo- 
rem [3] It would appear that the methods in the two approaches can be pushed in slightly different 
directions, so that neither method properly contains all results which can be produced by the other. 

Tantalizingly, however, either method seems to include a class of test functions - lying within the 
class of test functions with converging variance - for which a central limit theorem, by analogy with 
random matrix theory, ought to be true, but for which we have no proof. This matter is discussed 
further in the appendix. 



This section consists mainly in minor quantitative refinements in the argument of Hughes and 
Rudnick [14] . In turn, their argument is similar to Selberg's in making use of the fundamental 
theorem of arithmetic to evaluate certain integrals. Our main tool in what follows will be the well 
known explicit formula relating the zeroes of the Zeta function to the primes. 

Theorem 8 (The explicit formula). For g a measurable function such that g(x) — g \ x+ )+ g \ x ~) } 
and for some 6 > 0, 



2. Local Limit Theorems for Smooth Test Functions 



(a) 





we have 




where here ip(x) — J2n<x^( n )> f or the von Mangoldt function A. 
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The integral on the left denotes a principle value integral, lim^^tx, J_ L - In what follows we will 
frequently work with test functions for which the distinction between this principle value integral and 
an ordinary integral disappears, and where this is the case we will cease to make one in notation. 

Written in this way, the explicit formula is true only on the Riemann hypothesis. It is due in varying 
stages to Riemann [20] . Guinand [11] . and Weil [31] . and expresses a Fourier duality between the 
error term in the prime number theorem and the error term for of the zero-counting function. 

Without the Riemann hypothesis, we must write the left hand side as 

h\<L J - L 

where our sum is over 7 (possibly complex) such that h + ij is a nontrivial zero of the zeta function, 
It is proven by a simple contour integration argument, making use of the the reflection formula to 
evaluate one-half of the contour. (For a proof, see [15] or [18] .) 

We will also need the following corollary of the prime number theorem. 

Lemma 9 (A prime number asymptotic). For f compact with bounded second derivative, 

P) s H*f(¥) - °( " /IU + + " r '" ) + f «<*»•■ 



PROOF. That something like this is true is evident from the prime number theorem (or even 
Chebyshev), but some formal care is required to get the desired error term. We will need that, 

E ]M£. l0g „ + c + O(l ^, 

for some constant C, which is a formula on the level of the prime number theorem (and can be proven 
from the prime number theorem with a strong error term using partial summation.) 



We have then, using the abbreviation F(x) — xf(x), 



1 log 2 p flogp 



H 2 



H 



H 



=0 



p ^ log n \ _ ^ log(n+l) ' 



II/II00 + 11/11 
H 2 



E 



logn-log(n + l) logn w logr , 
H H { H 



by partial summation and the mean value theorem. Again using the mean value theorem, this time 
to approximate an integral, we have that this expression is 



O 



ii/iico + iiriioo + iiriic 

H 2 



+ / xF'(x)dx, 



which upon integrating by parts is the right hand side of ([2]). 



□ 



In what follows instead of working with the average A J T we work with smooth averages J a(t/T)/T 
for bump functions a. What we will show is that 

Theorem 10. For rj as in Theorem [7J and a non-negative of mass 1 such that a has compact 
support and cr(£) log fc (|i| + 2) is integrable, 



a{t/T) 



T 



™<T) -, fe/2 

|x||?7(a;)| dx 

-n(T) 
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We will show that this implies Theorem[7]at the end of Section 4. We have a computational lemma. 

Lemma 11. Given non-negative integrable a of mass 1 such that a has compact support, and inte- 
grable functions 771, 172, ••• ,Vk such that supp^ C [— Si, St] with Si + §2 H — • + Sk = A < 2. For large 
enough T (depending on A and the the region in which a is supported), 

(3) 



/. ^ S (£ * (¥(& - «»«<«) * n 



logT 



logT 

e n^(^). 



where the sum is over all n £ N k , e E {— 1, l} k such that n^i 



k =l i=\ 
1. 



PROOF. By the explicit formula, the right hand side of ([3]) is 



T VfJri-oologT 



1 



E 



log fe T 



<=1 



We can expand the product Y[e~ x '^ 2 d(e Xe — ip(e Xe )) into a sum of signed terms of the sort d@i(xi) ■ 
■ ■ dpk(xk), where df3e(x) is either e x l 2 dx or e~ x / 2 di/j(e x ). In the case that at least one dpj in our 
product is e x ' 2 dx we have 



log fc T 



< 



TO I 



so that in this case 



a;*;)) 



log fc T 



< 



Tlog^T 



_rp5j/2 



< 



Tlog fc T 

T A / 2 TT ||% 



n 



logT 



Into such error terms we can absorb all products d/3\ ■ ■ ■ dfik except that product made exclusively 
of prime counting measures, namely (— l) k Y[ e~ x ^ 2 dtp(e xe ). Evaluating the integral of this product 
measure we have that the left hand side of (JSJ is 



n 



T l-A/2 11 bgT y \logT 



E E ^(-^(eilogni- 

eG{-l,l} fe n£N fe 



- efc logn fc ))n^(^)- 



Note that if |ei logni 



log 2 



• • + e fc logn fe | is not 0, it is greater than | log(l - l/y/h~i ■ ■ ■ n k )\ > ^ ni ... nk 

since rii is always an integer. As ■ s /n\ ■ ■ ■ < T A / 2 = o(T) and a has compact support, for large 
enough T our sum is over only those e, n such that e\ log m + • • • + ek log rife = 0. □ 



Finally, we can use our prime number asymptotic, Lemma [5J to obtain 
Lemma 12. ForU\,...,Uf. with bounded second derivative 



(4) 



1 

w 



71. "-n 



; fe=! £=1 



f^)=%+ e s,-o*(n 

0CJC[fc] 



II 00 



II 00 
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where \k] — {1, k\ and here for a set J we define 

where the sum is over all partitions of J into disjoint pairs {i\,j\\- 
Said another way, 

1/2 



Sj = ^2 I!! / \ x \ u t{x)u 7T ( g) {~x)dx 



reC(J)l 

where the set C(J) is null for \ J\ odd, and for \ J\ even is the set of (| J| — 1)!! permutations of J whose 
cycle type is of \J\/2 disjoint 2-cycles. 



Proof. By Lemma[!jJ for any i,j, 



x\ui(x)uj(— x) dx + O 



(5) = / \x\ui(x)uj(~ x) dx + O 



\\UiUjl\oo + || || OO + \\(UiUj)"\ 



H 2 

k\\oo + IK||oo + \\u"\\oo 



H 



ij\\oo + II^Hoo + Halloo 



H 



where the initial sum is over all primes pi,j>2 and signs e\, 62 with p^vi = !• 
It follows that 

1 _ " 1 0KP/ / e „l gp^\ ~ « r\ ( TT 11^11°° 1 1 ^ 1 1 00 

W ^ ii-y^^[—s-) [fcl + ^-^lll # 

pn... p^lfcl V" 0C,7C[fc] V ££J 

as, by the fundamental theorem of arithmetic, p^ 1 ■ ■ ■ p^ — 1 if and only if primes match up pairwise 
Pi = pj with ej = —6j. The error term listed accumulates by expanding those products in which terms 
of the sort ([5]) occur. 



It remains to show that 



logpA _ a n ( TT W U i\\oo + Halloo + IKI 



II 

00 



(6) ^ e n^( £ ^)= e ^.o fc (n 

Pi Pfc — 1 

where the sum is over primes pi,...,pk, signs ei,...,efc, and positive integers (Ai,..., Afc) € \ 
{(1, 1, 1)}. But the left hand side sum of ([6]) restricted to A with Ai > 3, A& > 3 is plainly 

fc 11 1 



H 



=1 

On the other hand, for A with Xj fixed to equal 2 for some j, by the fundamental theorem of arithmetic 

Pi lAl ' ' ' Pk° Xh ~ 1 onr y m * ne case that pj = pj' for some j' 7^ j, so that thus restricted left hand side 
sum of (J6j) is 

ic[fc] v Pj J ' p eei ^ 

where the sum with index labeled p is over p,X,e such that FUei^ = 1j an( i ^ ^ as the function 
in this sum of collecting those Pi which are not equal to Pj. This expression is unpleasant, but our 
consolation is that it is only an error term. Applying it inductively, to bound the sums restricted to 
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Yli eI p e / Xe — 1, yields the Lemma. (We have here fixed \ 3 ; = 2, but of course to get an upper bound 
we need add at most k sums like this.) □ 

As a consequence of Lemmas [TTJ and [121 with H — > 
Corollary 13. For n and a as in Lemma XTTS. 



Ti-A/2 11 log T/n(T) 

1=1 6 ' V 0CJC[/c] 

where Sj is defined as in Lemma with ut = rjg. 



log T/n(T) 



Remark: As an aside, we note that by modifying the above analysis, making A small enough, one 
can obtain an asymptotic even in the case that n(T) grows like 0(T 1-<5 ), for S > 0. In this case 
the result is less elegant, since the arithmetic factors present in Lemma [TT] do not smooth out in the 
final asymptotic. We do not pursue these computations here, but they can be used to recover Fujii's 
macroscopic result, Theorem [5] 

From Corollary [TBI it is an easy computation to see that 

Lemma 14. For r\, a and n{T) as in TheoremVQ with r),cr, and k fixed 

a{t/T) 



(7) 



T 



-i k 



dt = (c k + o(l)) 



\x\\fj(x)\ 2 dx , 

-n(T) 



provided that K is chosen based on k to have sufficiently compact support. 

PROOF. Note that [K n{T) * v(^m)T(0 = n(T)K(£)fj(n(T)(,). By Corollary [lj for K chosen to 
be supported in (— 1/k, 1/k) we have the left hand side of is 

k/2 



(Cfc + 0(1)) 



K 2 



(^)W-W(-)! 5 



dx 



Because r\ is of bounded variation, f]{x) — 0{l/x), and for any c\ > C2 > 0, 

c 2 n(T) / ,n(T) 

2j " * A - ' |a;||r)(a;)| 2 dx ), 



|a;||77(aj)p<2a: < log(ci/c 2 ) = o 

cin(T) \J-n(T) 



since this latter integral diverges. As we have that when x — ¥ 0, if 2 (x) = 1 + o(l), 



(^>H<K*)I S 



n(T) 



»(T) 



|x||f)(a;)| 2 dx. 



□ 



3. An Upper Bound 

We will be able to complete the proofs of our central limit theorems by showing that the left hand 
side of ([7]) is a good approximation to the left hand side of the equation in Theorem[7] We accomplish 
this mainly through the use of the following upper bound 



2 Even in the case it converges this o-bound is true, albeit for a different reason. 
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Theorem 15. For a as in Lemma XTlS. 



(8) 



a(t/T) 



T 



n(fK-iW« 



dt < fc 



T 



with 



L/=-0O 

where for typographical reasons we have denoted the interval [kv — k/2,kv + k/2) by Ikiy), and the 
order of our bound depends upon k, \\a\\ and the region in which a can be supported. 

PROOF. We make use of the Fourier pair V(£) = (^jr-) 2 and V(x) = (1 - |a;|) + . Note that 

The right hand side of this is similar to M k r] and we denote it by M' k rj. What is important about the 
scaling is that V Vt k is supported in (— l/k, l/k). Note that the left hand side of (O is bound by 

-I k 



< 



T 



dt 



where 



.4 : 



B 



< [A 1/k + B 1/k ] k , 

°{t/T) \ r°° MLv{ ^_ t))dS(0 

J — oo 

/oo 
M^(^(e-t))log(|C|+2)dg 
-oo 



T 

a(t/T) 



dt, 



T 



dt. 



by Minkowski, and the fact that £l(£)/2w = 0(log(|£| + 2)). 

By the restricted range of support for V u> i and Lemmas [TT1 and [T2l for integers v-y, v k 

I n ( fl ^(^r^ - t))ds(^yt = o k (i). 



Whence, taking a multilinear sum, 



as log(|£| + 2)>l. 
Finally, 



A ~ k IE sup w 

< B 



oo 1 

^— ' 1 + u 

fi——oo 



so using log(|f + ^1 + 2) < log(|e| + 2) log(| M | + 2), 



B < 



T 



M k n{^f{Z-t))-\og{\t\ + 2)di 



dt. 



These estimates on A and B give us the result. 



□ 
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This result should be viewed as a slight generalization of an Oa(1) upper bound given by Fujii for 
the average number of zeros in an interval [t, t + A/ logT] where t ranges from T to 2T [9]. 



4. Proof of Theorems [2] and [3] 

We are now finally in a position to prove our main results. We first prove Theorem 1101 then pass 
to Theorem [7] (and hence to Theorem [2]) . 



Proof of Theorem [TOj We want to show that 

i k 

logT 



En 



a(t/T) 



T 



dt 



is asymptotically negligible. In part because k can be odd, we must use some care. To this end we 
have the following lemma. 

Lemma 16. For (X,dfj,) a measure space, f, g real valued functions on X, and k > 1 an integer 



\f-g\\ 



L k {dn) 



{WfW k L 



L k ldii) + llflli*^)) 



Proof. If f k and g k are both almost everywhere the same sign, this is implied by Minkowski 
(with implied constant k). On the other hand, if f k and g k are almost always of opposite sign, 
the estimate is trivial. We can prove the lemma in general by breaking the integral over X into 
two integrals over these subcases, and combine our estimates by noting that for positive a and b, 
a a + b a < 2max(a Q ,6 Q ) < (a + b) a , where (in our case) a = (k - l)/k. □ 



(9) 



This leads us to consider 

a(t/T) 



T 



f (v-K n (T)*v)(^^-t))dS(0 

J — CO 



dt. 



which by Theorem [15] is bound by 



< 



/ M k/n{T) ( V -K n(T) *r 1 )(^^(Z-t))-\og(\t;\+2)dfi dt 

J — OO 

2g£ / M kMT) ( V -k niT) * V )®]og(\t+?$P-Z\+2)dt 



< 



T 
T 

a{t/T)\og k {\t\+2) 
T \og k T 

2nn{T) 



i k 



dt 



dt 



-i k 



n(T) / M fc/n(T) (77 - K n{T) * Tj)(0de 



logT 



M k/n{T) (rj - K n[T) * Mlfl + 2R 



Note, if we label L{£) = log(|^| + 2), we have M k/n{T) ( V - K n(T) * v )(t) log(|C| + 2) < M k/n(T) [(n - 

&nlT)*V)L] (0- 

At this point we make use of the fact that rj is of bounded variation. Because rj has compact 
support, 



J log(|£| +2)\d V (0\ 



< +oo. 
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In addition, K n (r) * i] is bounded in variation for the same reason that 

J log(|€|+2)|d^ n(T) *» 7 (0|=A'(0) J log(|£| + 2)|^(0| < +00. 

By the product rule then, var[(?7 — K n (T) * w)L\ is bounded, for var(-) the total variation. 
We have the following three lemmas: 

Lemma 17. For f G L 1 (M) and of bounded variation var(/), and K as above, 

\\f-*H*f\\ L i<^. 

The proof is utterly standard, but I was unable to find a reference. The key point is that K is 
smooth and compact, so that |x||i\"(x)| is integrable. 

Proof. Note that K H {x) = HK(Hx), so 
\\f-k H *f\\ L i = 



j HK(Hr)f(t) dr- J HK(Hr)f(t — r) dr 
H j K(Hr)\\f(t)-f(t-T)\\ Ll{dt) dr 
H J K{Ht)( J J \df(t + h)\dhdt)dr 
= H J k{Hr)\T\dT -var(/) 



< 



< 



< var(/) 
~ H ' 

□ 

Likewise, because |if(x)||x| 2 is integrable, and |AT(x)| |x| log(|x| + 2) is of order |_ftT(x)||x| around 
x = and is bound up to a constant by |K(a:)||x| 2 otherwise, we have similarly, 



Lemma 18. 



\\f-K H *f\\L^ m+ 2)dt)<^ I log(|i|+2)|#(t)|. 



Finally, 

Lemma 19. For f of bounded variation, and any e > 0, 



E £ ll/IU( £[fe _ 1/2 , fe+1/2) ) SII/IU* +-var(/). 



k— — 00 



PROOF. For arbitrarily small e', we can choose xu € e[k— 1/2, fc+1/2) so that |/(xfc)l * s sufficiently 
close to II f I / , , , s \ that 

" J n L=°(£[fc-l/2,fc+l/2)j 

00 

E e ii/n^( £[fc -i Afe+ i/2))^ e '+ £ Ei/^)i 

< e' + ^(x 2j+ 2 - x 2j )\f(x 2j )\ + ^( x 2j'+i - ^2j'-i)|/(x 2 j-i)|. 



k——oo 
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More, 



\m\ - I/Mi 

%2j + 2 



dx 



W(x)\ 



\f\dx - Y^(x 2 j+2 ~ %2j)\f(X2j)\ < 

3 3 

< ^2(x 2j + 2 ~X2j) I 

i Jx - 

< 3e • var(/) 

as (x2j+2 — %2j) < 3e always. The same estimate holds for a sum over odd indices, and we have then 
Veil/11 (< , , <\ <e' + 6£-var(/) + 2 / \f\dx. 

Z-^i ,]J " L°°[e[k-l/2,k+l/2)) ~ w ' J 

k 

As e' was arbitrary, the lemma follows. 



□ 



Making use of these lemmas we have that 



M k / n ( T ){rj - K n ( T ) * v)(C)d£, < v ,k 



1 



and 



M k/n{T) [(r) - K n{T) * if) • L] < rhk 



n(T)' 
1 



n(T) 



(10) 



Hence © is bound. By Lemma UM 

a(t/T) 



T 

a(t/T) 



(fc-i)/fc 



dt 



T 



(ft-l)/fc 



dt 



For k even, this implies by Lemma 1141 (our Fourier truncation central limit theorem), and the fact 
that J \x\ \fj\ 2 dx = +00, 

k _ rn(T) 



v(t/T) 
T 



dt={c k +o{\)) 

v(t/T) 



n(T) 



|a;||f)(a;)| dx 



fc/2 



o 



T 



(fe-i)/fe-i 



dt 



This bound implies the left hand side diverges, and thus the conclusion of Theorem [TO] for even k. 
For odd k, by Holder (or Cauchy-Schwartz), 



(11) 



a(t/T) 



T 



v(£$i(Z-t))dS(0 



dt< (V^ + o(l)) 



n(T) 



i(T) 



|x||f)(a;)| 2 dx 



fe/2 



and hence, using (fTOj) again, Theorem ITO1 for odd k as well. 



□ 



PROOF of Theorem [JJ To see that Theorem [POl implies Theorem[7J note that for any e > 0, we 
can find <j\ of the sort delimited in Theorem llOl so that ||l[i 2] — ciIIl 1 < e /2- Further, we can find a%, 
a linear combination of translations and dilations of the function ( sl " f 7rt ) 2 , so that 02 is no n- negative 
and |l[i,2](i) — cxx (i) | < U2it) for all t, and 1 1 cr 2 1 1 ^ 1 < £• Note (for simplicity of notation) that (fTTj) is 
true for even k as well, and by rescaling linearly, we have 



l^f 1 £j(^-t))dS(0 



dt < e(V^+o(l)) 



n(T) 



i(T) 



\x\\fj(x)\ 2 dx 



fe/2 
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Then 

1[1,2](*/T) 

T 



ri(£m(t-t))dS® 



dt= [cfc+o(l)+e.(O fc (l)+o(l))] 



n(T) 



n(T) 



|x||r)(x)| dx 



fc/2 



(Note that here the Ofc(l) term is bound absolutely by ^/olk.) As e is arbitrary, the theorem follows. 

□ 



Proof of Theorem [3J A proof will follow almost exactly as before. We need only to show that 
Theorem [7] is true for r\ instead of the sort delimited in Theorem [3] The reader may check that the 
only part of the proof which requires modification is that the error term Et, at the start of section 4, 
cannot be shown to be asymptotically negligible in the same way as before, since now asymptotically 
negligible means that Et = o(l). But using Lemma 1161 in the same way as before, this will be the 
case, and therefore the theorem, so long as 

(12) \\r)-K H *ri\\ L i=o(l/H), 

for some K as above. This is certainly the case for those r\ delimited in Theorem [2] using the fact 
that for such 77, 17(f) = o(l/(l + |f|) 2 ). □ 



Remark: (IT2l is true for a wider range of functions than C C 2 (K); but it does not encompass the 
elegant criterion, "all functions which are of bounded variation and compactly supported." It is not 
the case for rj a Cantor function, for instance. We expect the theorem to remain true in this case, 
but to prove this would seem to require upper bounds on correlation functions for Zeta zeroes with 
respect to oscillatory functions, extending outside the range of functions considered by Rudnick and 
Sarnak. Although here we require only upper bounds, not exact evaluations, this still goes beyond 
what we currently seem able to prove. 



5. Appendix: Towards a Mesoscopic Theory 



We include in this appendix a more general discussion of the statistics of the zeroes of the Zeta 
function in the mesoscopic regime. Our discussion will culminate in Theorem (2U a statement from 
which one can deduce both the microscopic linear statistics of the sort considered by Rudnick and 
Sarnak and the central limit theorems discussed above, along with covariance statements for translated 
linear statistics separated by mesoscopic distances. 

To motivate what follows, we want to show first that Corollary [T3] implies the well-known result of 
Rudnick and Sarnak that, upon ordering the positive ordinates of the zeroes < 71 < 72 < 

Theorem 20 (Rudnick-Sarnak). For 17(61, ...,£&) such thatsuppi) C {x £ R k : \x\\-\ \-\xk\ < 2}, 

lim i r ri(^f(7i 1 -t),...,^(7 ik -t))dt= j V (x) det[S( Xi - Xj )]d k x, 

ii,...,i k 
distinct 

where 5(f) = si °^ and detkxk[S(£i — fj)] is a k x k determinant with ij th entry S(£i — £,-). 



That is to say, with respect to sufficiently smooth functions, the zeroes of the Zeta function tend 
weakly to a determinantal point process with sine-kernel. 

One may do this either through a combinatorial sieving procedure - effectively this is the proof of 
Rudnick and Sarnak - or alternatively one may use the combinatorics of Diaconis and Shahshahani. 
For us, it will be more enlightening to use the latter. Proceeding in this manner originated with 
Hughes and Rudnick, although our range of test functions will coincide with the slightly wider range 
used originally by Rudnick and Sarnak. 
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The theorem of Diaconis and Shahshahani we will need is 

Theorem 21 (Diaconis-Shahshahani) . LetlA(n) be the set ofnxn unitary matrices endowed with 
Haar measure. Consider a = (a\, ...,ak) and b — (pi,...,bk) with a\, aa, 61, 62! ••• € {0,1,...}.// 



(13) 

Furthermore, in the case that 



[ T[Trtf) a >Tr(jgj) b idg = 0. 
Mn) fJl 



(fc fc \ 

3=1 j=l / 

we have 



(14) 



/ n TttfY'Ttwy* 'in = Sab n / , ". ; 

■/«(») 7=1 *=i 



In addition, for unrestricted 



fc fc 

/ niw)i 2a3 '^<n^ 

Mn) ,- = 1 7 = 1 



but we will not need this fact. In general, for products of traces outside of the restricted range of the 
theorem, no pattern emerges which is as nice as (fT4")) . Since our restricted range here corresponds - as 
we will show shortly - to the only range of test functions for which the statistics of the Zeta function's 
zeroes can be rigorously evaluated, this fact must be seen as somewhat curious. 

Here trace is defined in the standard way, so that Tr(/„ xn ) = n. For a proof of Theorem [21] see 
M or [2]. 

It is a simple exercise in enumerative combinatorics to see that (TT3]) and (Tl4]) imply that for \ji | + 
••• + |jfc|<2n 

fc 



)U(n) e=1 a 

where once again the sum is over all partitions of [k] into disjoint pairs {fi\,v\}, and <5(j MA = —jv x ) 
is 1 or depending upon whether j MA = — j„ A or not. 

We are able to use this to study the determinantal point process with sine kernel because the 
eigenvalues of a random unitary matrix, properly spaced, are themselves a determinantal point process 
with kernel tending to that of the sine kernel. This is due, in effect, to Weyl. 

Proposition 22. Let {e(#i), e(02), e(9 n )} be the eigenvalues of a random unitary matrix, dis- 
tributed according to Haar measure, with 9i G [—1/2,1/2) for all i. Then the points {n0\, n9 n } 
comprise a determinantal point process S n on [— n/2,m/2) with kernel in x,y given by S n (x — y) — 
— P"? 7r / X "~ 2/ A ; . That is for any test function n, 

n sm(7r{x — y) /nj j a j n 



E s n ^C&i >-'&*) = / Y r t {n9 il ,...,n9 ik )dg 



JU(n) , 

distinct distinct 



n(xi, ...,x k ) det[S n (xi - Xj)] d'x 

-n/2,n/2] k kxk 
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For further discussion see [5]. 
We use this to prove 

Theorem 23. If S is the determinantal point process with kernel in x, y given by S(x—y) for S(x) = 

m ^ x , then for functions r]i, such that, as in Lemma llll supple [— 8i,5(\ withSi-\ h£fc < 2, 

fe 

(15) E 5 ]J (A m - E S A % ) = S [k] 

where S\u is defined as in Corollary [721 an d here A v = X^te) as before, for {^} the points of the 
process. 

Note that here, by definition, EA,, = J ndx. 

Before we come to the proof, we note that as an easy consequence, upon expanding the product in 
1151 and applying induction, 



Corollary 24. A point process V satisfies (| 15[) for all k over the range of test functions restricted 
as in Theorem \23\ if and only if for all k and for any integrable r\ defined on WL k with supp eta C {y g 

R k ■■\yi\ + --- + \y k \<2}, 

E s V •»,&„)= / V( x i, -,Xk) det[S{x l - Xj)]d k x. 

' J R k fexfe 

ii,...,i k 

distinct 



PROOF of Theorem [23J For a function n, define 

n {n) (9) = ^n(Q + nk). 
fcez 

Note that for Schwartz 77, r)( n > — > 77 uniformly We have then that for fixed Schwartz 771, rjk, 

fe k 

E S J] (A % - E S A % ) = Jim E S „ [] (V> ~ Es »\ ( "0 
e=i t=\ 

[■ k , n .1/2 \ 

- L , II (E^(^) - » 7 4 n \n0)de\ dg 
JU{n) e=1 \ v —i J-1/2 / 

But by Poisson summation, 

rl/2 



- jez\{0} 



so that 

/ H E ^ (n) w - n / ^ de)dg= j]E -fc (-) W) d 3 

But for 771, 77fe restricted as in the Theorem, this sum is only over those j with |^-|H — ' + lirl — 2- 
In this case the above sum reduces to 

eii( e IMm?)> 

Clearly this tends to Sru . □ 
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PROOF of Theorem l20l Using Corollary IT3l for n(T) — 1, we have for r)i,...,T]e as in Theorem 

E3 

But by Stirling's formula, 

/~ K^te ~ *)) = E ^(^(-r - *)) - / ^ + (!)- 

Expanding the product as in Corollary 1241 and passing from a to In. 2] as before yeilds the claim for 
T) = r\x ® • • • ® f]k- We can pass to general r\ by uniformly approximating such r\ and using Theorem 
1151 to bound the difference between the linear statistics of 77 and those of its approximation. □ 



The convergence here is microscopic, and therefore cannot, unless spread over a wider region as in 
Corollary [231 yield a mesoscopic central limit theorem like Fujii's or Theorem^ In a general way, it 
does appear that in the mesoscopic regime, the zeroes of the Zeta functions are spaced like the points 
of a sine-kernel determinantal point process - and that moreover we have knowledge of this fact as 
long as any test functions used remain microscopically band-limited (or quite close to it). Stating this 
principle in a way which is both (i) precise, and (ii) satisfying, is a rough task however. We shall make 
an attempt below, but we should be forthright that it is only the first of these conditions and not the 
second that is really achieved. Before proceeding, it is worthwhile to discuss the matter heuristically 
somewhat further. 

We say that a point process V is "mock-determinantal with sine-kernel" if its correlation functions 
agree with that of S with respect to sufficiently smooth test functions; that is 



lX,...,lh »!,...,** 

distinct distinct 



with respect to - say for our purposes - 77 with fj supported on {x £ K fc : |xi|H \-\xk\ < 2}. Using the 

proof above for the zeroes of the Zeta function, one can show that Theorems [2] and [3] hold for any such 
V. That is for rj restricted as in either theorem, a parameter L which grows, and A,, = ^2r](£i/L), 

V EA "^(0,1), 
V'VaTA; 

as L — > 00. (As here we are dealing with a single point process V , 'mesoscopic' restrictions on the 
growth of L play no role.) We may ask whether there exists any such mock-determinantal point 
processes V for which 77 is of bounded variation, but (A^ — EA J( )/ ^/VarA r) does not tend to the 
normal distribution. I do not know the answer to this, but I suspect that there does. This would 
imply that to fill the small gap between Theorems [2] and [3] and their random matrix analogues will 
require (a small amount of) statistical information about the zeroes of the zeta function outside of 
that provided by test functions which are band-limited as in Rudnick-Sarnak. 

We return to our goal of characterizing the zeroes of the Zeta function in the mesoscopic regime in 
a way that retains microscopic statistics as well. We have: 

Theorem 25. Let a be as in Theorem \1(A and let Zt be the point process defined by the points 
{ ^°2^ (7 ~ t)} where 7 runs through the ordinates of zeroes of the Zeta function, and t is a random 
variable in K with distribution given by o~(t/T) /T. For fixed A < 2, fixed r of compact Fourier support, 
and fixed n(T) with n(T) — > go but with n{T) = o(logT), we have that for |cki ! + ■•• + \a>k\ < A, 

fe k 

E Zt [J (A, - EA<) = E s H (A, - EA,) + ]T e([k}\ J) ■ E s J[ (a, - EA £ ), 

£=1 i=\ 0cjc[fc] eeJ 



18 



BRAD RODGERS 



where 

for the terms e([k]\J) having no dependence on on with i G J, and tending to uniformly as T — > oo. 

This may be proven by following exactly the proof of Corollary [T3] By slightly modifying the proof, 
one may prove this theorem even for a = l[i.2] so that t is uniformly distributed between T and 2T, but 
we do not pursue this matter here. By integrating in a, one can obtain microscopic and macroscopic 
statistics, and correlations thereof, uniformly for points separated by a distance asymptotically less 
than m(T). One can, for instance, recover Corollary [T3l for n(T) — o(m(T)) in this way. We are able 
to integrate in a without destroying error terms for the reason that s([k]\J) has no dependence on 
on for i £ J. 

In the same way, by modifying the proof of Theorem 1231 

Theorem 26. For fixed A < 2, fixed r of compact Fourier support, and fixed n(N) with n(N) — > oo 
but with n(N) = o(N), we have that for \ai \ + • • • + \ctk\ < A, 

fc k 

E Sjv J] (A, - EA<) v J J (A< - EA £ ) + Y, £ ( W\ J ) ' fi 5 II ( A ^ - EA ^) ' 
l=i t=i 0cjc[fe] ee.J 

for Ai (defined in the obvious way with respect to n(N)), and e as in Theorem \25l 

To have a more eloquent expression of the mesoscopic convergence expressed by these results would 
certainly be desirable. 

We want finally to point out again that Selberg's approximation to S(t), mentioned in the introduc- 
tion, and therefore Fujii's Theorem's [1] and |6l are true unconditionally. The first of these claims was 
shown by Selberg, using a zero-density estimate to bound the number of zeroes lying off the critical 
line. I have been unable to extend this method to prove Theorem [5] unconditionally, where the points 
we are counting are the imaginary ordinates of non-trivial zeroes, which may in some instances lie off 
the real line, but it may be possible to do so all the same. 
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